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Abstract. Current evidence does not favor the existence of the + pentaquark, which was reported by 
several groups in the years after 2002. The question naturally arises: how could many groups could have 
seen fluctuations in their data at the level of 3-5 a statistical significance? An example of a statistical 
fluctuation is given and the number of cr's necessary for claims of discovery are examined. Using this 
guideline, a possible answer to the above question is presented. 

PACS. 12.39.Mk Multi-quark/gluon states - 13.60.Rj Baryon production 



1 Introduction 

The & + pentaquark, if it exists, has a predicted pQ quark 
structure of ududs and a narrow width. Recent theoretical 
work [5] , as an alternative to Ref . [T] , gives a modern esti- 
mate of its width. The question of whether the <9 + exists 
or not is an important one for quantum chromodynam- 
ics (QCD). In particular, if the & + were to exist, then 
non-perturbative QCD would require quark-quark corre- 
lations (or some other mechanism) that would prevent the 
+ from quickly falling apart into lighter components (a 
kaon plus a nucleon) [3]. Hence, the simplest quark model 
predicts pentaquarks with a wide width, which would be 
difficult to observe in experiments. 

The observation of a potentially narrow pentaquark by 
the LEPS Collaboration [J at a mass of 1540 MeV was 
initially greeted with some skepticism. If true, this would 
suggest that the color-magnetic attraction between quarks 
could be stronger than previously thought [5]. Other exper- 
iments set out to either confirm or deny the existence of 
the <9 + using data already available. Experiments from 
DIANA [6], CLAS [7|[8] and SAPHIR [9] found positive 
evidence for the 6> + , seeming to confirm the LEPS result. 
Other positive results followed within the year, as well 
as null results (for a review, see Ref. [E]). The conflicting 
positive and null evidence were confusing, and clearly new 
experiment with better statistics were needed to provide 
answers. 

Next, the statistical significance of the positive evi- 
dence for the (9 + is reviewed. Then data with higher statis- 
tics will be examined, and the possibility of repeating 
some previous experiments with more statistics will be 
discussed. 
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2 Statistical Significance 

Soon after evidence for the + was published, inconsisten- 
cies in the mass measured by different experimental groups 
were noticed. In addition, the statistical significance of 
the <9 + peaks was questioned, due to uncertainties in the 
shape of the background under the "peaks" . A nice review 
of these uncertainties was presented by Pochodzalla |llj . 
where the data was plotted with error bars and without 
fitted curves. The reader is encouraged to read Ref. [11] 
for more details. 

Here, I will take a slightly different approach. My goal 
is to determine what level of statistical significance is suf- 
ficient before one can reasonably claim that a new particle 
has been seen. The simplest case to study is a Monte Carlo 
spectrum generated with a flat background, as shown in 
Pig- HI Your eye is drawn to the point in the middle, which 
is unusually high. What is the statistical significance of 
this "peak"? This question is not well formed, so let us 
examine two ways to ask a better question. 

First, we will assume that the "peak" in Fig. Q] is a 
fluctuation of the uniform background. A fit to the back- 
ground gives a value of B = 15.0 ± 0.4 counts. The signal 
above background is S = 36 — 15 = 21 counts. The sig- 
nificance of the background fluctuation is [T^] , in units of 
standard deviations: 

S/VB + V 

where V is the variance in the background. In this case, 
the variance V is small, and the significance is, to good ap- 
proximation, 21/4 ~ 5cr. In other words, the background 
has a standard deviation, as — VB of ±4 counts, and the 
fluctuation of the "peak" is five times this standard devia- 
tion. The probability of a 5cr fluctuation is about 5 x 10~ 5 , 
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Fig. 1. Example of a Monte Carlo spectrum with a flat distri- 
bution of the spectrum and a mean value of 15 counts. 

or about once per half-million cases. Indeed, this is easy 
to verify by Monte Carlo, by generating a million spectra 
and seeing how many times the "peak" exceeds 20 counts 
above background. Of course, the fluctuation can happen 
anywhere in the spectrum, which has 100 channels, so the 
probability of a 5a fluctuation anywhere in the spectrum 
is only once per 200 cases. 

The above discussion is rather pedantic, but for a rea- 
son. There are some publications in the + literature 
where the statistical significance is not properly calcu- 
lated. The primary reason is the denominator in the above 
equation, B + V. In the Monte Carlo example, the value 
of B + V was clear, but in real data the shape of the back- 
ground is often uncertain. So the variance can be large. 

Second, let us assume that the "peak" is a real signal. 
In this case, the Monte Carlo generator must be modified, 
so that the background is generated independently from 
the peak. In this case, the statistical significance is de- 
termined by the uncertainties in both the signal and the 
background. The uncertainty in the background is as — 4 
counts. The uncertainty in the signal is as = \/21 = 4.5 
counts. Adding these in quadrature gives a$+B = 6.0 
counts. Now the statistical significance is 21/6 = 3.5<r. 
This roughly agrees with an "eyeball" estimate from Fig. 
[TJ where the error bar on the "peak" is about ±6 counts, 
and the backgound is lower by the length of about 3 times 
the error bar. However, the probability of a 3.5<7 fluctua- 
tion is about one chance out of 1000. In other words, the 
probability of a real signal to fluctuate down into the level 
of the background is different than the probability of the 
background to fluctuate up, as shown in Fig. [1] 

The main point of this discussion is that your eyes can 
be deceiving when estimating statistical significance, and 



one must rely on solid statistical analysis to determine the 
significance of a signal. Perhaps the best way to calculate 
the significance is: (1) fit the shape of the background 
without the region of the "peak" , (2) use a Monte Carlo 
to generate random fluctuations around the background 
shape, (3) determine the number of times an excess is 
found in the "peak" region, (4) calculate the probability 
of a fluctuation of the given background, (5) repeat this 
for several reasonable variations of the background. By 
applying this procedure to some of the (9 + data, it is easy 
to see that a 5a signal can easily become a 3er signal for 
a different background shape (for example, see Ref. [13]). 

Of course, there are other complications in addition to 
those mentioned above. Arbitrary cuts in the data analysis 
should be avoided at all costs. Each analysis cut should be 
justified independent of its effect on the final mass spec- 
trum. Pessimistic estimates of the background should be 
shown in addition to the best-fit background shape. The 
data should also be examined with different binning, or al- 
ternatively use the log-likelihood analysis that is described 
in many textbooks. These techniques were used in some, 
but not all, of the + publications. The bottom line is 
that it takes time and effort to do a good statistical anal- 
ysis, and even then it may result in an estimate of the 
statistical significance of a peak that is too optimistic. 

So how many er's are needed to claim discovery? This 
question has been discussed by Ref. p~2] , where they argue 
that a 5a peak is not so uncommon, especially if there are 
several cuts (or event selections) applied to the data. In 
addition, an experimentalist may look at hundreds of spec- 
tra in the course of an analysis, and each spectrum may 
have hundreds of channels. As a result, I suggest that a 
statistical significance of la is a reasonable baseline for 
claims of discovery. Even so, a single instance of la is not 
convincing, and confirmation by an independent experi- 
ment at the same level of statistical significance is desired. 
This latter point is emphasized in the + review by the 
Particle Data Group [14]. 

In summary, a few results of only 4-5cr are not enough 
to claim discovery. Few people would be convinced that 
the "peak" in Fig. 1 is real, unless it could be reproduced 
with higher statistics. 

3 Review of <9 + Experiments 

The first five publications showing positive evidence for 
the + are given in column 1 of Table [1] The collabora- 
tions are: LEPS 0], Diana [6], CLAS [7], SAPHIR [9| and 
HERMES [15] . The quoted statistical significance in these 
papers is about 4-5 a, although the uncertainties in the 
background under the peaks suggests that the statistical 
significance should have been smaller. Each of these ex- 
periments has been repeated, sometimes by other groups, 
although the experimental conditions were not reproduced 
exactly. The "repeat" experiments are listed in column 4 
of Table [Q 

The LEPS collaboration repeated their earlier exper- 
iment, except using a deuterium target rather than Car- 
bon. Because the liquid deuterium target is longer than 
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the compact carbon target, one of the analysis cuts from 
the original analysis 4J (using vertex tracking to reduce 
background) was no longer possible. However, the statis- 
tics in the K~ missing mass spectrum was more than 5 
times higher. The resulting spectrum has been published 
only in a conference proceeding [16] and shows a peak at 
the same mass as before. The statistical significance of this 
signal was not reported, but is likely in the range of 3-5 a. 
The LEPS group decided to take more data on deuterium 
before publishing the results. 

It is perhaps worth noting that the LEPS result is 
the only one of the original five publications where the 
+ peak has been seen in the repeated measurement. Ei- 
ther this means that the + is very peculiar, requiring a 
special kinematic range to be produced, or some other ex- 
planation is necessary, such as a kinematic reflection from 
another reaction. Kinematic reflections have been inves- 
tigated by the LEPS group, but there is no obvious way 
that this could happen. Other explanations, such as ex- 
cessive data cuts, do not make sense because only basic 
cuts (such as particle ID and <fi exclusion) are applied, and 
variations of the cut limits do not erase the peak. How- 
ever, if the + is not seen elsewhere then the LEPS data 
alone are not sufficient to claim discovery. 

The DIANA results [6] are still viable, but just barely. 
These data are from old bubble chamber experiments us- 
ing a K + beam on Xenon. Cuts arc applied to reduce the 
background from kaon charge exchange. Their results were 
not reproduced directly, but are severely limited by anal- 
ysis from the Belle Collaboration. For the Belle results, 
a kaon was tagged from D-meson decay, which interacts 
with Silicon in their vertex detector, followed by detec- 
tion of pK° and pK + pairs. They estimate and subtract 
events from kaon charge exchange. The resulting mass 
spectrum does not show any + peak. Belle's upper limit 
for + production is below that calculated from the DI- 
ANA experiment, but still within one standard deviation 
of the DIANA result. 

While the Belle result does not entirely rule out the 
DIANA result, it puts a severe limit on the possible width 
of the + at less than 1 MeV. Such a narrow width is 
difficult to reconcile with theoretical estimates [T7], but 
still on the edge of possibility [2, 18]. At the very least, 
the DIANA result should be regarded with great caution. 

The original CLAS result used older data (called g2a) 
from deuterium that was analyzed quickly after the LEPS 
result was announced. The reaction is exclusive, where all 
momentum and energy is accounted for in the final state. 
However, this could only be done if the proton, which 
would normally be a spectator, was kicked out of the tar- 
get due to a two-step process. By studying the mirror 
reaction that produces the 4(1520), via the — > K + A*n 
reaction, two-step reactions accounted for about half the 
photoproduction cross section there. The two-step mech- 
anism, required to detect the proton in the + reaction 
has both good and bad aspects. The good side is that an 
exclusive reaction can be measured. The bad side is that 
the amplitude for this reaction is difficult to calculate. 
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Fig. 2. Comparison of the original CLAS data [7] (points with 
error bars) and the repeat measurement [13] (solid histogram, 
scaled down by a factor of 5.92). 



The repeat of the CLAS result is shown in Fig.[2]by the 
solid histogram (which has been rescaled down by a factor 
of 5.92) overlayed on top of the original "g2a" data. In this 
comparison, the photon energy range was constrained to 
be the same in both analyses. The new result shows that 
the shape of the background differs from the one expected 
from the g2a data. In particular, the g2a points on ei- 
ther side of the "peak" drop below the histogram. Using 
the new data, the g2a "peak" is about a 3a fluctuation 
[T5] , However, the complication of the two-step reaction 
prevents the new data from placing strong limits on the 
fundamental (one-step) + cross section. An upper limit 
of 3 nb on the total cross section was found [T3] , but this 
number depends on the rescattering model used. 

The SAPHIR Collaboration were the first to publish a 
+ search using the jp — > K + K°n reaction [9]. Using a 
proton target essentially eliminates two-step reactions and 
hence this measurement would be more convincing if the 
+ were seen. The SAPHIR experiment used a kinematic 
fit to determine the particle identification (PID). Because 
their detector had a short path for time-of- flight, detection 
of kaons was difficult. However, they saw a clear peak for 
the .4(1520), showing that their PID was good enough for 
this purpose. After applying a forward-angle cut on the 
K°'s, they found a + peak with a cross section that was 
a substantial fraction of the 4(1520) cross section. 

One year later, the CLAS Collaboration measured the 
same reaction as SAPHIR, but with more than 10 times 
higher statistics [19] . The resulting + mass spectrum was 
completely void of peaks, even when SAPHIR's data cuts 
and photon energy range were applied. An upper limit of 
1 nb was found for the total cross section, which is over 
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100 times smaller than the yl(1520) cross section. Clearly, 
the SAPHIR result could not be reproduced. 

The HERMES Collaboration [TS] used high-energy e+ 
on a fixed deuteron target to produce events with a K° 
and a proton. The invariant mass spectrum for K°p pairs 
showed a peak near 1528 MeV, about 10 MeV lower than 
the + peak from other experiments. Their analysis was 
admirable in working to quantify the background under 
their peak, and a good job in estimating the statistical 
significance. 

The BaBar experiment, using colliding e + e~ beams, 
had some beam halo hit the Berilium beam pipe in their 
detector. The resulting luminosity was large, and hence 
they had data for high-energy e + on a fixed Be target 
with high statistics. Their results have not yet been pub- 
lished, but were reported by Dunwoodie [20j where pre- 
liminary results showed a smooth spectrum without any 
+ peak. Although there are questions in comparing the 
BaBar results to those of HERMES, it seems that again 
the + could not be reproduced. 

Having seen that the + could not be reproduced in 
four out of five attempts to confirm the earlier evidence in 
independent experiments, it only makes sense to be highly 
suspicious of the existence of the + . Does this mean that 
experimentalists should stop searching for pentaquarks? 
My answer is no, for the reasons given in the next section. 
But clearly mistakes were made in the statistical signif- 
icance estimates of some early searches and extra effort 
is necessary to guard against future over-optimistic esti- 
mates. 



4 Pending Searches 

There are several pentaquark searches that have positive 
results with high statistical significance, but have not yet 
been repeated. These experiments are listed in Table[2]for 
the collaborations ITEP [21], ZEUS [22], CLAS [8] and 
COSY/TOF [23J. Two of these (ZEUS and COSY) have 
new data with higher statistics, but no results yet. The one 
with the highest estimated significance (CLAS) requires 
a higher energy beam than Jefferson Lab can presently 
extract, but will likely be repeated in 2008 (as part of a 
run that will also measure meson production). 

The logic of why the pentaquark searches should be 
continued is obvious. If the pentaquark exists, then this 
has important consequences for non-perturbative QCD. 
The & + has not been seen in some experiments, but to 
conclude that the + will not be seen in any experiment 
is illogical. Having said that, it is also true that it is un- 
likely that the + peaks seen in the experiments of Table 
[2] will be reproducible, based on the irreproducibility of 
several experiments in Table Q] However, if the physics is 
important, then the data should not be left in this con- 
fused state. The experiments should be repeated and ei- 
ther confirmed or rejected. 

Since no new results are yet available for the exper- 
iments listed in Table [21 let us take a closer look at the 
published results. The ITEP group [H] analyzed neutrino- 
nucleus interactions in bubble chamber data, and formed 



the invariant mass for K p pairs. Looking at their data, 
I estimate about 17 counts in the peak on top of a back- 
ground of about 15 counts. Using the equation above, with 
V = 0, gives 4.4cr, far below the estimate of 6.7a claimed 
in their paper. The significance is likely lower, since the 
variance is certainly bigger than zero. 

The ZEUS results [22] require a cut on Q 2 > 20 GeV 
to see a peak. It is not clear why this kinematic cut is 
necessary, but from analysis of other known resonances, 
the ZEUS group claims that better resolution is obtained 
in their detector when this cut is used. However, a bigger 
question is why other high-energy experiments such as 
CDF, HERA-B or FOCUS (see Ref. for a summary) 
do not see any + peak in their data? If the ZEUS result is 
correct, then this argues that the + can be produced by 
fragmentation, which would be seen in many high-energy 
experiments. This suggests that the peak seen by ZEUS 
may not be the + but some other feature of their data. 

The CLAS results [8] in Table [2 are different from the 
two CLAS experiments in Table [1] The former measured 
the 7r + K~ K + p final state, with a cut on ir + backward of 
about 37° in the center-of-mass frame. Without this cut, 
the + peak cannot be seen. See Ref. [8] for a justification 
of this cut. Looking at their data, I estimate a statistical 
significance of 4-5<7 rather than the 7-8<7 given in Ref. [8] . 
This is still a significant result, and it will be interesting 
to see whether their peak is reproducible when they take 
more data next year. 

The COSY result is perhaps the most intriguing of the 
results in Table [51 It is the only one where a hadronic 
proton-proton reaction is used. This has an advantage as 
there is almost no background, since the reaction is so 
close to threshold. Although some people have questioned 
their particle identification scheme, the confusion is likely 
due to the fact that they use a purely geometric technique, 
which is different from that used by many other experi- 
ments. A close examination of their data analysis has con- 
vinced me that it is reasonable. The key thing about the 
COSY data is that they have a new data set with more 
than five times higher statistics. If they can reproduce 
their + peak, then this will generate a lot of interest 
(and a lot of questions) in the nuclear physics community. 

I have not mentioned all experiments with evidence for 
(or against) the <9 + , but a comprehensive review is beyond 
the scope of this article. Instead, I have chosen to mention 
only experiments that I know have been repeated or are 
likely to be repeated in the near future. 



5 Theoretical Developments 

With the new negative evidence from the "repeat" exper- 
iments in Table [H the question arises as to whether the 
+ is still theoretically viable. In other words, is there a 
theory that is consistent with the negative evidence and 
also the potentially positive results (from LEPS)? The the- 
ory calculation by Nam, Hosaka and Kim [24] answers 
in the affirmative, provided that the + has spin-parity 
r=2,/2+. 
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In Ref. [24] , they argue that for + production, the s- 
and u-channel diagrams are suppressed compared with the 
i-channel and the contact term. They also note that the 
contact term is present only for production on the neutron. 
For J = 3/2 in their model, they find that the contact 
term dominates by a factor of 25-50 over K* exchange 
in the i-channel. Hence it is possible to see the + at 
LEPS (for production from the neutron) and not see it 
for the SAPHIR reaction (using a proton target), provided 
the width of the + is 1 MeV or less. Furthermore, their 
calculation for J 11 = 3/2+ gives almost all of the cross 
section at forward kaon angles, where the CLAS data [7] 
has a hole in the acceptance. 

Although the above model is consistent with the nega- 
tive evidence from CLAS (both proton [19] and deuteron 
[T5] data) , it still does not explain all of the negative ev- 
idence from high-energy experiments. If the + exists, 
then it would be necessary for its production to be strongly 
suppressed in fragmentation processes. Using QCD string 
theory, one model [25] shows that it is very difficult to 
produce the Q + in fragmentation. 

With advances in lattice gauge theory, one can also 
ask whether lattice calculations see any hint of the + . 
The answer is yes, but only for spin-parity J n = 3/2 + . The 
evidence is presented in Ref. [26] (see Fig. 9 of their paper). 
The resonance signature, which they have seen for other 
known baryon resonances, is for the mass to drop below 
that of the scattering states, which is seen only for J 71 " = 
3/2 + . The authors also show that there is no resonance 
signal for J n = l/2 ± or for 3/2 _ . 

It is worth mentioning that there is one other lattice 
result for J = 3/2, done by a Japanese group [27] • Their 
result has fairly large effective quark masses, and their ef- 
fective mass plots for positive parity have a strange shape 
(see Fig. 3 of Ref. |27j ) and so it is not clear whether their 
null results are more valid than those of Ref. [26] . 

The bottom line is that it is still possible for the + to 
be consistent with all existing data, provided it has J n = 
3/2 + and a narrow width. This would make the + a 
very peculiar particle, but not yet outside the realm of 
possibility. 



6 Summary 

Using a Monte Carlo spectrum as an example, we see that 
a 4cr peak is not very convincing. Furthermore, several 
4-5cr results should not be taken too seriously. It was sug- 
gested that two independent results of la or more is the 
threshold where discovery can be claimed. Experimental- 
ists should aspire to making pessimistic, not optimistic, 
estimates of the statistical significance of a mass peak. 

Of the first five publications showing positive evidence 
for the + , only one (LEPS) has been able to reproduce 
a peak in a "repeat" measurement. However, the second 
LEPS result has been presented only at conferences, and 
has not been formally published. The bottom line is that 
the + has not stood up to the scrutiny of higher-statistics 
measurements. Several other + experiments have yet to 
be repeated. 



The outlook for the + looks bleak, but this does not 
mean that we should abandon pentaquark searches. At 
present, the data are still contradictory, and experimen- 
tal facilities should endeavor to pursue the outstanding 
claims and put this question (whether the + exists or 
not) to rest. I suggest that we be patient yet cautious as 
the experiments make progress. 
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Table 1. The first five positive (9 publications and the results of repeat measurements with higher statistics. The column 
labeled a show the statistical significance quoted in the publication. The column labeled "Increase" shows the factor by which 
the number of counts in the mass spectrum increased. 



Original Experiment 



Repeat Measurement 



Group 



Reaction 



cr's I Group 



Reaction 



Increase 



Result 



LEPS -yC -> K+K~X ~ 4 

DIANA K+Xe -> K°pX ~ 4 

CLAS -yd -> K+K-pn ~ 5 

SAPHIR 7p -> K+K-n ~ 5 

HERMES e+d -> if °pA ~ 4 



LEPS 

Belle 

CLAS 

CLAS 

BaBar 



K+K-X 
K+Si K°pX 
'yd — » K + K~pn 

e+Be -» 



- 5 
~ 10 

> 6 

> 10 

> 100 



~ 3-5cr 

r e+ < 1 MeV 
<7tot < 3 nb 
(Ttot < 1 nb 
No e+seen 



Table 2. Publications with positive evidence for the + that have not yet been repeated with higher statistics. As before, the 
statistical significance (cr's) is that given in the published paper. (See the text for references.) 



Group 



Reaction 



cr's Comment 



ITEP vA -» P K°X 6-7 

ZEUS e+p -> pK°X ~4.5 

CLAS TP -f Tr+if-if+n 7-8 
COSY pp-*E-K a p ~5 



NOMAD i/ experiment also sees a small peak 

new data with improved vertex detector is being analyzed 

needs 5.5 GeV beam; might be rerun in 2008? 

new data with 5 times higher statistics is being analyzed 



